Subspace Information Criterion for Infinite Dimensional Hypothesis Spaces

نویسندگان

  • Masashi Sugiyama
  • Klaus-Robert Müller
چکیده

A central problem in learning is to select an appropriate model. This is typically done by estimating the unknown generalization errors of a set of models to be selected from and by then choosing the model with minimal generalization error estimate. In this article, we discuss the problem of model selection and generalization error estimation in the context of kernel regression models, e.g., kernel ridge regression, kernel subset regression or Gaussian process regression. Previously, a non-asymptotic generalization error estimator called the subspace information criterion (SIC) was proposed, that could be successfully applied to finite dimensional subspace models. SIC is an unbiased estimator of the generalization error for the finite sample case under the conditions that the learning target function belongs to a specified reproducing kernel Hilbert space (RKHS) H with dimH less than the number M of training samples. In this paper, we extend the range of applicability of SIC, and show that even if dimH > M , SIC is an unbiased estimator of an essential part of the generalization error. Our extension allows to make use of infinite dimensional RKHSs, i.e., richer function classes commonly used in Gaussian processes, support vector machines or boosting. We further show that when dimH > M , SIC can be expressed in a much simpler form, making its computation highly efficient. In computer simulations on ridge parameter selection with real and artificial data sets, SIC compares favorably with other standard model selection techniques for instance leave-one-out cross-validation or an empirical Bayesian method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selecting Ridge Parameters in Infinite Dimensional Hypothesis Spaces

Previously, an unbiased estimator of the generalization error called the subspace information criterion (SIC) was proposed for a finite dimensional reproducing kernel Hilbert space (RKHS). In this paper, we extend SIC so that it can be applied to any RKHSs including infinite dimensional ones. Computer simulations show that the extended SIC works well in ridge parameter selection.

متن کامل

Subspace-diskcyclic sequences of linear operators

A sequence ${T_n}_{n=1}^{infty}$ of bounded linear  operators on a separable infinite dimensional Hilbert space $mathcal{H}$ is called subspace-diskcyclic with respect to the closed subspace $Msubseteq mathcal{H},$ if there exists a vector $xin mathcal{H}$ such that the disk-scaled orbit ${alpha T_n x: nin mathbb{N}, alpha inmathbb{C}, | alpha | leq 1}cap M$ is dense in $M$. The goal of t...

متن کامل

The Subspace Information Criterion for Infinite Dimensional Hypothesis Spaces

A central problem in learning is selection of an appropriate model. This is typically done by estimating the unknown generalization errors of a set of models to be selected from and then choosing the model with minimal generalization error estimate. In this article, we discuss the problem of model selection and generalization error estimation in the context of kernel regression models, e.g., ke...

متن کامل

Total subspaces in dual Banach spaces which are not norming over any infinite dimensional subspace

Total subspaces in dual Banach spaces which are not norming over any infinite dimensional subspace Abstract. The main result: the dual of separable Banach space X contains a total subspace which is not norming over any infinite dimensional subspace of X if and only if X has a nonquasireflexive quotient space with the strictly singular quotient mapping. Let X be a Banach space and X * be its dua...

متن کامل

About Subspace-Frequently Hypercyclic Operators

In this paper, we introduce subspace-frequently hypercyclic operators. We show that these operators are subspace-hypercyclic and there are subspace-hypercyclic  operators that are not subspace-frequently hypercyclic. There is a criterion like to subspace-hypercyclicity criterion that implies subspace-frequent hypercyclicity and if an operator $T$ satisfies this criterion, then $Toplus T$ is sub...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001